Lexicon of Common Scientific Words and Expressions for Automatic Discourse Analysis of Scientific and Technical Texts
نویسنده
چکیده
Various NLP applications require automatic discourse analysis of texts. For analysis of scientific and technical texts, we propose to use all typical lexical units organizing scientific discourse; we call them common scientific words and expressions, most of them are known as discourse markers. The paper discusses features of scientific discourse, as well as the variety of discourse markers specific for scientific and technical texts. Main organizing principles of a computer dictionary comprising common scientific words and expressions are described. Key ideas of a discourse recognition procedure based on the dictionary and surface syntactical analysis are pointed out.
منابع مشابه
Language Features of Russian Texts of Engineering Discourse
The Article is devoted to the applied problem of identifying the linguistic features of engineering texts. The study of Russian-language texts of engineering discourse is usually of an applied nature, in our case, this applied research is caused by the need to teach foreigners who receive professional engineering education in Russia and in Russian language. The object of the research is the Rus...
متن کاملTesting Problems in Russian as a Foreign Language in a Technical University
Problems of theory and practice of the Russian as a foreign language testing for entrants in technical universities are considered. The benefits of test forms for controlling the foreign students’ skills in the Russian language during a hard time limit are presented. The structure and content of the tests, all types of tasks offered on the entrance and final examinations in the Russian languag...
متن کاملA Comparative Study of Ideational Grammatical Metaphor in Scientific and Political Texts
Language, science and politics go together and learning these genres is to learn a language created for codifying, extending and transmitting scientific and political knowledge. Grammatical metaphor is divided into two broad areas: ideational and interpersonal.This paper focuses on the first type i.e. Ideational Grammatical Metaphor (IGM), which includes process types and nominalization. The m...
متن کاملA Discourse Structure Analysis of Technical Japanese Texts and Its Implementation on the WWW*
This paper deals with a discourse structure analysis of technical Japanese texts for developing a Japanese writing Computer Assisted Language Learning (CALL) system whose goal is to assist students in learning to write technical Japanese texts. To analyze discourse structures of technical Japanese texts, cohesive expressions are used as cue words. The rules for analyzing texts are based on micr...
متن کاملA Linguistic Analysis of Conference Titles in Applied Linguistics
Over the past twenty-five years, researchers have expressed considerable interest in titles of academic publications. Unfortunately, conference paper titles (CPTs) have only recently begun to receive attention. The aim of this study, therefore, is to investigate the text length, syntactic structure, and lexicon of CPTs in Applied Linguistics. A data set of 698 titles was selected from the 2008 ...
متن کامل